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Abstract 

Contemporary mobile devices are battery powered and due to their shrinking size and increasing complexity operate on a tight 
energy budget. Thus, energy consumption is becoming one of the major concerns regarding the current and upcoming wireless 
communication systems. On the other hand, the available bandwidth resources are limited and modem applications are throughput 
demanding, leading thus to strong competition for the medium. In this direction, we consider a stochastic contention based medium 
access scheme, where the devices may choose to turn off for some time in order to save energy. We perform an analysis for a 
slotted ALOHA scenario and we show that the energy constraints, if properly exploited, may reduce contention for the medium. 
Our results give valuable insights on the energy-throughput tradeoff for any contention based system. 

Keywords: energy saving, contention, ALOHA, game theory 

L Introduction 

As mobile communications become part of our everyday life, new challenges for the system designers come to the foreground. 
First of all, the scarcity of bandwidth resources leads to extreme competition for the medium. Besides, the total energy dissipation 
by communication devices has been shown to amount to a significant portion of a nation's power profile, motivating efforts 
of per device energy economy. In an attempt to minimize their energy footprint and/or maximize the battery lifetime, existing 
wireless devices support radio sleep modes. 

A generic wireless terminal consists of several circuit building blocks with the RF transceiver (radio) contributing significantly 
to the overall energy consumption. The RF transceiver itself consists of four subblocks. The transmit block that is responsible for 
modulation and up-conversion (i.e. transforms the baseband signal to RF), the receive block dedicated to the down-conversion 
and demodulation, the local oscillator that generates the required carrier frequency, and the power amplifier that amplifies the 
signal for transmission. Existing wireless devices support radio sleep modes that turn off specific subblocks, to minimize their 
energy consumption while inactive. For example, as shown in Table |l] the CC2420 transceiver ([4]) provides three different 
low power modes. In the deepest sleep mode, both the oscillator and the voltage regulator are turned off, providing hence the 
lowest current draw. However, this comes at the cost of the highest switching energy cost and the longest switching latency. 
On the other hand, the idle mode provides a quick and energy inexpensive transition back to the active state, but at the cost 
of higher current draw and consequently higher current consumption. 

To address this tradeoff, the authors of |6| propose a scheme to dynamically adjust the power mode according to the traffic 
conditions in the network. They show that in a low traffic scenario, a deep sleep mode should be preferred since most of the 
time the nodes tend to be inactive. In a high traffic setting though, a "lighter" sleep mode is preferable, because frequent mode 
transitions incur high delay and energy costs, exceeding any energy saving coming from the low current draw. 

In this direction, several energy aware MAC protocols have been proposed, either centralized or distributed ones, to resolve 
contention. However, most of them rely on the willingness of the nodes to comply with the protocol rules. Hence, they are 
vulnerable to selfish users that may deviate from the protocol in order to improve their own performance. Game theory comes 
as the ideal tool to model interactions among self-interested entities competing for common resources |7| and it has also been 
considered recently for medium access. 

In [5J, the authors study the Nash Equilibrium Points (NEP) in a slotted ALOHA system of selfish nodes with specific quality- 
of-service requirements. It has been observed that usually selfish behavior in medium access leads to suboptimal performance. 
For example, a prisoners dilemma phenomenon arises among selfish nodes using the generalized slotted Aloha protocols of 
lH). A decrease in system throughput, especially when the workload increases due to the selfish behavior of nodes, is observed 
in m, 13]. In an attempt to mitigate the effects of selfishness, the authors of ^ study the problem of minimizing the energy 
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Fig. 1. The structure of a superframe 

consumption for given throughput demands for a contention MAC. They show that whenever the demands are feasible, there 
exist exactly two Nash equilibrium points and derive a greedy mechanism that always converges to the best one. 

In this paper we introduce an additional level of decision making capturing the ON-OFF strategy of the terminals over the 
classic ALOHA game. Thus, we model contention for the medium as a game, where users with specific energy constraints 
select both the proportion of time that they sleep and their medium access probabilities. In this point we should mention 
that the main characteristics of the slotted ALOHA are also apparent in most contemporary contention-based systems, such 
as the 802.11. For example, all these systems exhibit a certain amount of inherent inefficiency. The throughput breaks down 
significantly, as the number of users and the message burstiness increase. In theory, an ideal CSMA/CA protocol should provide 
the same throughput, independent of the number of radios operating in a frame. In practice though, in order to avoid collisions 
some communication overhead has to be added, making hence the throughput a decreasing function of the number of operating 
radios. For instance, 802.11b achieves a 2 to 4Mbps effective throughput with only a few stations talking versus its maximum 
raw data rate of 5.5Mbps. A very accurate approximation of the actual throughput was derived in |2|. Consequently, our results 
can also be insightful for other contention based systems. 

To the best of our knowledge, this is the first work that addresses the interplay between contention and energy consumption 
for systems that support sleep modes. The contributions of this paper can be summarized in the following: 

• We quantify the interplay between contention and energy saving. 

• We characterize the throughput optimal strategy, given the energy constraints. 

• We develop a distributed approach that captures the notion of proportional to the energy budget fairness. 

• We also formulate contention as a non-cooperative game among self-interested entities. 

• We show that the resulting game has a unique NEP. 

• We show that the energy constraints cause bounded Price of Anarchy. 

• Based on the rationality of the users, we derive an improved alternative strategy and show that it has multiple NEPs. 

II. System model 

We consider a communication scenario, where a set M of mobile terminals, with \N\ — N, wishes to transmit to a common 
destination (e.g. uplink to a Base Station). Time is slotted and within each timeslot each user may select either to transmit or 
to stay silent. Thus, the number of active users within the timeslot t can be modeled as a stochastic process Na{t). We assume 
that medium access is performed probabilistically, according to a slotted ALOHA protocol, where a collision occurs whenever 
two or more terminals transmit concurrently. Each terminal has always buffered packets for transmission (i.e. saturated queue), 
but limited energy resources. Each device i is characterized by an energy budget e^, representing either its available battery 
power or the maximum energy it is willing to pay for. In order to save energy it can turn into a sleep mode, where most of 
the circuits are turned off. For analytical tractability we assume that each terminal may be in one out of two possible states, 
either ON or OFF 

In general, a mode transition incurs significant energy and time (delay) costs. Besides, due to hardware limitations the time 
required for a mode transition is of at least order of msec, much larger than the duration of a timeslot. Consequently, the mode 
transition at the timeslot level is neither feasible nor desirable and hence we introduce a new timescale, which we call frame, 
and where the mode switching takes place. Several timeslots constitute a frame and an arbitrary number of frames, say TV, 
forms a superframe. The duration of the frame is dictated by the time required to obtain convergence to the ALOHA mean 
behaviour, whereas the horizon of operation can be arbitrarily large. As depicted in Fig. [T] the beginning of each frame is a 
decision point, where a node may change its operation mode. Inside any given frame, the nodes keep their mode fixed (either 
ON or OFF). Then, within the frame any active node may access the medium randomly according to a probability p. This 
probability is also assumed fixed on a per frame basis. 

The control of a user k for each superframe j can be represented by a binary vector q^.{j) = {0, 1}^, denoting the ON-OFF 
states, and an access probability p^, i.e. we assume here that a user selects his access probability only once per superframe. 
In practice though, the mobile terminals are capable of making some decisions, but very rarely can exchange enough real 



time information to perform deterministic control and achieve the optimal behavior. A convincing example for using simple 
probabilistic controls is the framework of the 802.11 networks, prevailing in the world of computers, where the access behavior 
is dictated by probabilistic control. Similarly, by design choice we study a probabilistic version of the aforementioned problem 
which can be stated as follows. 

Each user i is characterized by a probability of being ON, denoted with qi and a medium access probability pi. In matrix 
notation the strategy space can be written as I = {p, q}, with p = [pi,P2, ■ ■ -Pn] and q = [qi,q2, . . . qN]. In our setting, the 
throughput of a user i can be defined as the number of successfully exploited slots per unit time, and is a random variable 
with a mean of: 

%{pi,qi) =Piqi Jl (1 --Pj9j) '^^=^^ Jl {i-pjqj) (1) 

Note that the per user throughput is an increasing function of the decision variables pi and qi, but decreasing in the number 
of terminals N contending for the medium. The latter is in compliance with the classic ALOHA, but in practice also holds 
for CSMA/CA protocols. 

The energy cost of user i is a random variable with a mean value of Ei{pi,qi) — qi{ci + C2Pi), where ci is the energy 
consumption of the ON state and C2 the additional cost imposed by the transmission. Obviously, in order to be able to transmit, 
the node has to be in the ON state. Here, we have not considered the actual energy consumption of the transition itself. 

III. The impact of constrained energy resources on the system throughput 

In this general setting, we would like to find the ON-OFF and the medium access probabilities that maximize the collision- 
free utilization of the medium, and consequently the throughput of the system. This can be formally expressed as the following 
optimization problem: 
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Throughout the paper, and without loss of generaUty, we assume that the users are ordered in decreasing energy budget, i.e. 

ei > 62 > . . . > e-N- 

A. Throughput optimal scheduling in energy constrained ALOHA with sleep modes 

In the classic ALOHA setting, where no energy constraints exist, the throughput optimal strategy would be the one that 
eliminates contention. Thus, if we could force only a single user, say user fc, to access the medium with probability pk = 1 in 
each frame, we would achieve the maximum total throughput. In our scenario though, due to the energy constraints, the users 
may not be able to stay continuously ON (i.e qk = 1) or to transmit with pk = 1. Then, what is the best way to exploit the 
available energy resources? For each user we need to find the portion of energy to spend for staying ON during the frames 
and the portion used for transmitting within an ON frame. 

Lemma 1: Out of all the throughput optimal strategies the most energy efficient ones are of the form I* — {l,a} with 
1 = [1, 1, ... 1] and a e [0, 1]^. Thus, without loss of optimality we may restrict the strategy search space only to strategies 
where the nodes transmit continuously inside any ON frame. 

Proof: Let I be a feasible throughput optimal strategy and aj = Piqi. From eq. [l] we may see that throughput depends 
only on the pq products. Thus, the strategy I* — {l,a} with a = [piqi,P2Q2, ■ ■ -PiQi] achieves the optimal throughput, i.e. 

f{i*) ^T(iy 

Then, we prove that I* is also feasible, as the most energy efficient strategy. 

Any other feasible throughput optimal strategy I can be written as an expression of a as {pi — ^ , qi — ai + 6i}, with 
< Si < min{l — a^, 5iiZL?ii^ii±5i?l j. Thus, regarding the energy efficiency we have: 
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This completes our proof. 



Based on Lemma [T\ the optimization problem described by eq. |2] can be simplified to an expression that depends only on 
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q leading to the objective function T{q) — qi Y[ (1 ^ Qj) ™d constraints < Qi < qt — min | ^^'^jj^^ , l|; this can be 
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further simplified into a problem of binary integer programming. 

Lemma 2: The optimal solution is of the form q* = b*diag[(7i, (j2, ■ . ■ qn] where b* is a binary row vector. 
Proof: The partial derivative of the objective function is given by: 
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We have thus shown that the sign of the partial derivative depends only in the parameter — ^ — . This leads to the 
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}, otherwise. 
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Lemma 3: The optimal solution b* is of the form b* = [1, 1, . . . 1, 0, 0, . . . 0]. 

Proof: Assume that b of k ones mixed with zeros is the throughput optimal binary vector. We can construct a new vector 
b G [0, 1]^, which has activated only the first k users and gives identical throughput. To construct such a vector we need to 
move any isolated one, say from position I of the initial vector to a zero position say m with to < by setting bm = Qi- Then 
based on Lemma |2] we may fully activate or deactivate any of these users, getting thus better throughput. If this process is 
repeated some times, we finally end up with a vector of higher throughput. This leads us to a contradiction. ■ 

Based on the aforementioned lemmas we may derive the centralized Algorithm [T] that yields the throughput optimal 
probabilistic strategy and is of linear, in the number of users N, complexity. The main idea behind this algorithm is that 
contention may or may not be beneficial, depending on the energy constraints of the users. Namely, an additional user is useful 
if and only if the energy resources of the already active users are not sufficiently large, leaving thus the medium underutilized. 
An additional user introduces a gain due to the exploitation of the empty frames, but also a loss, due to the collisions whenever 
he is concurrently active within a frame with someone else. If the average gain is greater than the induced loss, it is beneficial 
for the system to be enabled. 

Algorithm 1 Optimal probabilistic frame scheduling 
1: Order users in decreasing e^. Without loss of generality, 

we reassign the indices such that qi > q2 > ■ ■ ■ > qw 
2: q ^0 

3: j ^1 

4: while j < N 

3 

and V — ^ < 1 do 

5: qj i~ qj 
6: j ^j + l 

1: end while 



Theorem 1: Algorithm [T] yields the throughput optimal probabilistic strategy. 
Proof: The optimality of [T] comes directly from Lemmas [T] |2] and |3] 

■ 

Although, this approach maximizes the total throughput, it also introduces coordination and fairness issues; in particular, 
leads to a deterministic frame scheduling mechanism requiring extensive coordination among the users. Besides, it causes 



extremely unfair treatment of the users with low energy budget. Thus, it would not be easily applied to dynamic distributed 
environments as the ones considered here. 



B. A distributed fair algorithm 

As an answer to the aforementioned problems and in an attempt to capture the notion of proportional fairness we may 

N 

substitute the original objective function with the following: U{p,q) = ^^lUilogT;. The multiplicative factor Wi can be 

used to balance the throughput among the users of the system at will. For example, the value Wi — ^ - would allow 
US to split the throughput proportionally to the energy budget of the users. By proper reformulation, the objective function 

N 

can be rewritten as U{p, q) — log [(Pi^i)""' (1 — Piqi)^^% where w^i = wy^ — \ — Wi. This is a separable per user 

function that leads to a fully distributed implementation, requiring minimal or even no information exchange. Actually the only 
information required is the value of the total energy available in the terminals, namely e^, which can also be estimated by 

kefyf 

each user through sensing. The solution of this optimization problem, in accordance to Lemma [Tl is of the form I* = {1, a} 
with a = [min{wi, gi} ,min{w2, 92} , • ■ • , min {w^v, <7Ar}]- 

C. A modified strategy 

Up to here we have assumed that each user makes a decision once for his strategy and applies it forever As a result, user k 
whenever active, transmits with pk = 1, independently of the number of active users within a frame. Thus, whenever two or 
more users select to transmit within a frame they receive zero payoff, but consume energy. Based on these, a rational player 
would be expected to backoff whenever a collision is detected. Although, the terminal is not allowed to switch off in a crowded 
frame, due to the switching time overhead incurred, it may reduce its access probability. This way it would avoid spending 
energy on useless collisions and could utilize these savings for pursuing further contention-free frames. Building on this idea 
we propose the following modified strategy. 

Any active user attempts a transmission within the first timeslot of the current frame. If the transmission succeeds he uses a 
medium access probability of pi = 1. Otherwise he adjusts his strategy, and reduces his transmission probability to pi. It can 
be shown that this new strategy always yields better throughput than the original one. The expressions for the throughput and 
the energy consumption are now respectively given by: 

f. = qA{l- pd n {l-q,)+p, II (1 ~ p,q,) \ (4) 
I jeAfV jeAf\i I 
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Algorithm |2] yields the throughput optimal strategy, by categorizing the users of the system into three groups, namely the 
aggressive, the conservative and the passive ones. The former capture the medium whenever they are active (ON), the second 
transmit only whenever they sense an empty frame and the last do not participate at all. It can be shown that in the optimal 
scheduling there exists at least one aggressive and one conservative user This algorithm is of exponential complexity and will 
be used only for comparison purposes. 

IV. Game Theoretic Approach 

In the previous section we derived probabilistic MAC protocols that require coordination of the actions of the users involved. 
However, in an autonomous setting as the one considered here, it is not necessary that the individuals will comply with the 
rules imposed by the protocol. Actually, at least some of the users may exhibit selfish behaviour and select the strategy that 
maximizes their own utility, namely their individual throughput, at the expense of others. Thus, in this section we model our 
initial problem as a non-cooperative game. 

A non-cooperative game is defined by a set of players, a set of strategies and a metric that indicates the preferences of the 
players over the set of strategies. In our case we have: 

• Players: the N users 

• Strategies: user's i set of feasible medium access and ON-OFF probabilities Xi = {pi.qi : Ei{pi,qi) < Ci and < 

Pl:qi<l} 

• User preferences: represented by a utility function Ui{Ii); peer i prefers strategy li to li iff Ui{Ii) > Ui{Ii). 



Algorithm 2 Modified optimal probabilistic frame scheduling 



1: Search over B = {A^C^V}, i.e. the set of all the possible partitions of M of size 3, 

with \A\ > 1 and |C| > 1 for the throughput optimal assignment: 
2: for all i e ^ do 

3 : {pi,qi} = {1, mini -^^^ , 1 1 } % aggressive ones 

4: end for 

5: for all fc e C do 

6: l/Dfc. qi } = \0. min < — ; — n-^ — ri 1 J' r' % conservative 

7: end for 
8: for all j eV do 
9: {pjjQj} = {0,0} %passive 
10: end for 



For the initial optimization problem (eq.|2ji the utility function of user i is defined as Ui{Ii) — Ti{pi, qi) = piqi Y\j£j\f\i{'^~ 
Pjqj)- By using the KKT conditions we may derive the following lemma: 

Lemma 4: The throughput optimal strategy for user i is = |l, min{ ^^j^^^ , The resulting game has a unique 

Nash Equilibrium Point, described by the strategy X* — {l,q*}, with q* = -^r^^- 

Proof: Since the utility is an increasing function of both pi and qi the energy constraint should be satisfied with 
equality at the optimum. Then, it can be shown through the KKT conditions that the optimal strategy is given by = 
{l'niin{^,l}}. 

Given that the strategy of a user is independent of the actions of the other users and depends only on his own energy 
constraint deriving the resulting NEP is straightforward. ■ 

This can be also justified by the fact that the throughput of user i is an increasing function of qi. Consequently, each 
individual will select its qi so as to satisfy the constraint with equality. On the other hand, since Tj is an increasing function of 
Pi, it would select a transmission probability equal to 1. Thus, we may deduce that at the Nash equilibrium point we receive 
throughput, only when a single user is ON in a frame. Given the NEP of the game we may quantify the performance loss 
arising due to the selfishness of the individuals, by using so called Price of Anarchy (PoA) metric. This is the ratio of the 
value of the objective function at the global optimum to its value at the NEP and in our setting is given by: 

PoA = '^^^ '-^ ^ > 1, (6) 

where S is the set of enabled users at the global optimum. 

Whereas in the classic ALOHA games the PoA is unbounded, in our energy constrained ALOHA setting the PoA is generally 
bounded, since the energy constraints are imposing a fictitious pricing scheme. The only case where the PoA grows unbounded 
is whenever two or more users have infinite power. These two users will involuntarily act as jammers for each other and for 
all the others yielding hence zero system throughput. 

A. The modified strategy as a non-cooperative game of perfect information 

Here, we consider the game arising from the modified strategy. In this new setting, we derive a non-cooperative game of 
perfect information, where in each iteration a user selects the best response to the others strategies. By best response we mean 
that each peer updates his decision variables, so as to maximize its utility function, as a response to the others' actions. 

Theorem 2: The best response strategy of user i is given by: 




Fig. 2. The throughput degradation compared to the optimal 
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The arising modified game has multiple NEPs. 

Proof: Since the utility is an increasing function of pi and qi, the constraint needs to be satisfied with equality. Thus, we 
may replace qi from eq. |5] into the throughput expression, namely eq. |4] Then, the partial derivative of the objective function 
is given by the following expression: 
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The sign of this expression depends only on (1 — Pjqj) and JJ^ (1 — qj). As a result given the actions of the others, 
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the objective function is either a strictly increasing or a strictly decreasing function of pi. Thus, the best response strategy of 
user i is given by |9] ■ 

V. Numerical results 

Here, we perform some simulations to quantify the throughput performance of the proposed schemes. We assume a network 
of = 5 terminals with energy constraints given by e = [30,25, 15, 10,5] and {ci,C2} — {50,70} units. We slightly abuse 
the definition of PoA, by using the modified optimal as the performance benchmark. Thus, the figures depict the performance 
degradation in comparison to the modified optimal. For the modified game we depict the PoA, the price of stability (PoS), 



defined as the throughput ratio of the optimum to the best NEP and the mean performance. Regarding the initial setting, we 
depict the performance degradation of the optimal, the fair and the initial game theoretic scheme. 

Initially, we consider how the energy constraint of the most powerful user affects the performance of the system as a whole. 
As shown in the figure on the left, the additional power budget increases the performance degradation due to the additional 
collisions caused. The system stabilizes for ii — ci + C2, where user 1 has sufficient energy to capture the whole medium on 
his own. On the right one, we depict the impact of the transmission cost C2 on the performance. Here, we see that in a scenario 
of low energy constraints the increased transmission cost makes the users less aggressive, leading thus to reduced collisions. 
Both figures indicate that the modified strategy of backing off when a collision is detected may dramatically improve the 
performance. 

VI. Conclusion 

This work is a first step towards characterizing the energy-delay tradeoff for mobile devices that support sleep modes and 
operate according to contention medium access schemes. We showed that energy constraints indirectly coordinate the actions 
of the players, and thus may reduce contention and lead to better exploitation of the medium. Here, we have assumed non- 
cooperative games of perfect information. However, there are contemporary wireless systems, where each involved entity has 
only a subjective belief on its opponents' strategies. The impact of incomplete information in our setting is an interesting topic 
of future study. 
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